Goto

Collaborating Authors

 continuous time analysis and application


Fictitious Play for Mean Field Games: Continuous Time Analysis and Applications

Neural Information Processing Systems

In this paper, we deepen the analysis of continuous time Fictitious Play learning algorithm to the consideration of various finite state Mean Field Game settings (finite horizon, $\gamma$-discounted), allowing in particular for the introduction of an additional common noise. We first present a theoretical convergence analysis of the continuous time Fictitious Play process and prove that the induced exploitability decreases at a rate $O(\frac{1}{t})$. Such analysis emphasizes the use of exploitability as a relevant metric for evaluating the convergence towards a Nash equilibrium in the context of Mean Field Games. These theoretical contributions are supported by numerical experiments provided in either model-based or model-free settings. We provide hereby for the first time converging learning dynamics for Mean Field Games in the presence of common noise.


Review for NeurIPS paper: Fictitious Play for Mean Field Games: Continuous Time Analysis and Applications

Neural Information Processing Systems

Weaknesses: The major weaknesses of this paper are listed here. More details can be found below. The presentation of the CTFP theory is confusing and inconsistent. In particular, Section 3 needs some substantial improvement. And although some ingredients of the algorithms are written with more details in the appendix, it's still unclear how each of the algorithm work as a whole.


Fictitious Play for Mean Field Games: Continuous Time Analysis and Applications

Neural Information Processing Systems

In this paper, we deepen the analysis of continuous time Fictitious Play learning algorithm to the consideration of various finite state Mean Field Game settings (finite horizon, \gamma -discounted), allowing in particular for the introduction of an additional common noise. We first present a theoretical convergence analysis of the continuous time Fictitious Play process and prove that the induced exploitability decreases at a rate O(\frac{1}{t}) . Such analysis emphasizes the use of exploitability as a relevant metric for evaluating the convergence towards a Nash equilibrium in the context of Mean Field Games. These theoretical contributions are supported by numerical experiments provided in either model-based or model-free settings. We provide hereby for the first time converging learning dynamics for Mean Field Games in the presence of common noise.